Improving Text Analysis Using Sentence Conjunctions and Punctuation
نویسندگان
چکیده
منابع مشابه
Bilingual Sentence Alignment Based on Punctuation Marks
We present a new approach to aligning English and Chinese sentences in parallel corpora based solely on punctuations. Although the length based approach produces high accuracy rates of sentence alignment for clean parallel corpora written in two Western languages such as French-English and German-English, it does not fair as well for parallel corpora that are noisy or written in two distant lan...
متن کاملImproving Text Segmentation Using Latent Semantic Analysis
Choi, Wiemer-Hastings and Moore (2001) proposed to use Latent Semantic Analysis to extract semantic knowledge from corpora in order to improve the accuracy of a text segmentation algorithm. By comparing the accuracy of the very same algorithm depending on whether or not it takes into account complementary semantic knowledge, they were able to show the benefit derived from such knowledge. In the...
متن کاملSemantic and Layout Properties of Text Punctuation
Higher-level graphical and lexical punctuation (paragraphing, indentation, font changes, ...) must be taken into consideration in comprehension processes and text generation. In this paper, we analyse a class of text punctuation marks which includes lexical units (chapter, introduction). We give a method for the analysis of the semantics of these units, in terms of metalanguage. In addition, th...
متن کاملText punctuation and prosody in Greek
A production experiment was carried out, in order to investigate text punctuation, including standard as well as ungrammatical (communicative) punctuation marks, and prosody relations. It is shown that punctuation is directly related to the duration of pauses, leading to the following structure: question mark>exclamation mark>full stop> colon>comma> ellipsis. Pitch resetting occurs in all cases...
متن کاملImproving Quality of Vietnamese Text Summarization Based on Sentence Compression
Sentence compression is a valuable task in the framework of text summarization. In previous works, the sentence is reduced by removing redundant words or phrases from original sentence and tries to remain information. In this paper, we propose a new method that used Grid Model and dynamic programming to calculate n-grams for generating the best sentence compression. These reduced sentences are ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Marketing Science
سال: 2020
ISSN: 0732-2399,1526-548X
DOI: 10.1287/mksc.2019.1214